SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: WOZNEY, John 

CELESTE r Anthony J. 
THIES, R. Scott 
YAMAJI, Noboru 

(ii) TITLE OF INVENTION: RECEPTOR PROTEINS 

(ill) NUMBER OF SEQUENCES s 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute Inc.- Legal Affairs 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/123,934 

(B) FILING DATE: 17-SEP-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: LAZAR, Steven R 

(B) REGISTRATION NUMBER: 32,618 

(C) REFERENCE/DOCKET NUMBER: 5203 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617 876 1170 

(B) TELEFAX: 617 876 5851 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1813 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: CFKl-23a 

( ix ) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 61. . 1656 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NOilt 



CTAGTGGATC CCCCGGGCTG CAGGAATTCT GCGGCCGCCA GGACACGTGC GAATTGGACA 60 

ATG ACT CAG CTA TAC ACT TAC ATC AGA TTA CTG GGA GCC TGT CTG TTC 108 
Met Thr Gin Leu Tyr Thr Tyr lie Arg Leu Leu Gly Ala Cys Leu Phe 
15 10 15 

ATC ATT TCT CAT GTT CAA GGG CAG AAT CTA GAT AGT ATG CTC CAT GGT 156 
He He Ser Hie Val Gin Gly Gin Asn Leu Asp Ser Met Leu His Gly 
20 25 30 

ACT GGT ATG AAA TCA GAC GTG GAC CAG AAG AAG CCG GAA AAT GGA GTG 204 
Thr Gly Met Lye Ser Asp Val Asp Gin Lys Lys Pro Glu Asn Gly Val 
35 40 45 

ACG TTA GCA CCA GAG GAC ACC TTA CCT TTC TTA AAA TGC TAT TGC TCA 252 
Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser 
50 55 60 

GGA CAC TGC CCA GAT GAC GCT ATT AAT AAC ACA TGC ATA ACT AAT GGC 300 
Gly His Cys Pro Asp Asp Ala He Asn Asn Thr Cys He Thr Asn Gly 
65 70 75 80 

CAT TGC TTT GCC ATT ATA GAA GAA GAT GAT CAG GGA GAA ACC ACG TTA 348 
His Cys Phe Ala He He Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu 
85 90 95 

ACT TCT GGG TGT ATG AAG TAT GAA GGC TCT GAT TTT CAA TGC AAG GAT 396 
Thr Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp 
100 105 110 

TCA CCA AAA GCC CAG CTA CGC AGG ACA ATA GAA TGT TGT CGG ACC AAT 444 
Ser Pro Lys Ala Gin Leu Arg Arg Thr He Glu Cys Cys Arg Thr Asn 
115 120 125 

TTG TGC AAC CAA TAT TTG CAG CCT ACA CTG CCC CCT GTC GTT ATA GGC 492 
Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val He Gly 
130 135 140 

CCA TTC TTT GAT GGC AGC GTC CGA TGG CTG GCT GTG CTC ATC TCT ATG 540 
Pro Phe Phe Asp Gly Ser Val Arg Trp Leu Ala Val Leu lie Ser Met 
145 150 155 160 

GCT GTC TGT ATT GTC GCC ATG ATC GTC TTC TCC AGC TGC TTC TGT TAC 588 
Ala Val Cys He Val Ala Met He Val Phe Ser Ser Cys Phe Cys Tyr 
165 170 175 

AAA CAT TAC TGT AAG AGT ATC TCA AGC AGA GGT CGT TAC AAC CGT GAC 636 
Lys His Tyr Cys Lys Ser He Ser Ser Arg Gly Arg Tyr Asn Arg Asp 
180 185 190 

TTG GAA CAG GAT GAA GCA TTT ATT CCA GTA GGA GAA TCA CTG AAA GAC 684 
Leu Glu Gin Asp Glu Ala Phe He Pro Val Gly Glu Ser Leu Lys Asp 
195 200 205 

CTG ATT GAC CAG TCA CAA AGC TCT GGT AGT GGA TCT GGA TTA CCT TTA 732 
Leu He Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 



48 



TTG GTT CAC CGA ACT ATT GCC AAA CAG ATT CAG ATG GTT CGG CAG GTT 
Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Arg Gin Val 
225 230 235 240 
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GGT AAG GGC CGG TAT GGA GAA GTA TGG ATG GGT AAA TGG CGT GGT GAA 828 
Gly Lye Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg Gly Glu 
245 250 255 

AAA GTG GCT GTC AAA GTA TTT TTT ACC ACT GAA GAA GCT AGC TGG TTT 876 
Lye Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe 
260 265 270 

AGA GAA ACA GAA ATC TAC CAG ACG GTG TTA ATG CGT CAT GAA AAT ATA 924 
Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu Asn He 
275 280 285 

CTT GGT TTT ATA GCT GCA GAC ATT AAA GGC ACC GGT TCC TGG ACT CAG 972 
Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly Ser Trp Thr Gin 
290 295 300 

CTG TAT TTG ATT ACT GAT TAC CAT GAG AAT GGG TCT CTC TAT GAC TTC 1020 
Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 
305 310 315 320 

CTG AAA TGT GCC ACC CTG GAC ACC AGA GCC CTA CTC AAG TTA GCT TAT 1068 
Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 
325 330 335 

TCT GCT GCC TGT GGT CTG TGC CAC CTC CAC ACA GAA ATT TAT GGC ACG 1116 
Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu He Tyr Gly Thr 
340 345 350 

CAA GGC AAG CCT GCA ATT GCT CAT CGA GAC CTG AAG AGC AAA AAC ATC 1164 
Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Lys Asn He 
355 360 365 

CTT ATT AAG AAA AAT GGT AGT TGC TGT ATT GCT GAC CTG GGC CTA GCT 1212 
Leu He Lys Lys Asn Gly Ser Cys Cys He Ala Asp Leu Gly Leu Ala 
370 * 375 380 

GTT AAA TTC AAC AGT GAC ACA AAT GAA GTT GAC ATA CCC TTG AAC ACC 1260 
Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp He Pro Leu Asn Thr 
385 390 395 400 

AGG GTG GGC ACC AGG CGG TAC ATG GCT CCA GAA GTG CTG GAC GAG AGC 1308 
Arg Val Gly Thr Arg Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 
405 410 415 

CTG AGT AAA AAC CAT TTC CAG CCC TAC ATC ATG GCT GAC ATC TAC AGC 1356 
Leu Ser Lys Asn His Phe Gin Pro Tyr He Met Ala Asp He Tyr Ser 
420 425 430 

TTT GGT TTG ATC ATT TGG GAG ATG GCC CGT CGC TGT ATT ACA GGA GGA 1404 
Phe Gly Leu He He Trp Glu Met Ala Arg Arg Cys He Thr Gly Gly 
435 440 445 

ATC GTG GAG GAA TAT CAA TTA CCA TAT TAC AAC ATG GTG CCT AGT GAC 1452 
He Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp 
450 455 460 
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CCA TCT TAT GAA GAC ATG CGT GAG GTC GTG TGT GTG AAA CGC TTG CGG 1500 
Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 

CCA ATC GTC TCT AAC CGC TGG AAC AGT GAT GAA TGT CTT CGA GCC GTT 1548 
Pro lie Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 
485 ^ 490 495 

TTG AAG CTG ATG TCA GAA TGC TGG GCC CAT AAT CCA GCA TCC AGA CTC 1596 
Leu Lye Leu Met Ser Glu Cys Trp Ala His Aen Pro Ala Ser Arg Leu 
500 505 510 

ACA GCT TTG AGA ATC AAG AAG ACG CTC GCA AAG ATG GTT GAA TCC CAG 1644 
Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 

GAT GTA AAG ATT TGACAAACAG TTTTGAGAAA GAATTTAGAC TGCAAGAAAT 1696 
Asp Val Lys He 
530 

TCACCCGAGG AAGGGTGGAG TTAGCATGGA CTAGGATGTC GGCTTGGTTT CCAGACTCTC 1756 
TCCTCTACCA TCTTCACAGG CTGCTAACAG TAAACCTTTC AGGACTCTGC AGAATGC 1813 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Gin Leu Tyr Thr Tyr He Arg Leu Leu Gly Ala Cys Leu Phe 
1.5 10 15 

He He Ser His Val Gin Gly Gin Asn Leu Asp Ser Met Leu Hie Gly 
20 25 30 

Thr Gly Met Lys Ser Asp Val Asp Gin Lys Lys Pro Glu Asn Gly Val 
35 40 45 

Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser 
50 55 60 

Gly His Cys Pro Asp Asp Ala He Asn Asn Thr Cys He Thr Asn Gly 
65 70 75 80 

His Cys Phe Ala He He Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu 
85 90 95 

Thr Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp 
100 105 110 

Ser Pro Lys Ala Gin Leu Arg Arg Thr He Glu Cys Cys Arg Thr Asn 
115 120 125 

Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val He Gly 
130 135 140 
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Pro Phe Phe Asp Gly Ser Val Arg Trp Leu Ala Val Leu He Ser Met 
145 150 155 160 

Ala Val Cye He Val Ala Met He Val Phe Ser Ser Cys Phe Cye Tyr 
165 170 175 

Lye Hie Tyr Cye Lys Ser He Ser Ser Arg Gly Arg Tyr Asn Arg Asp 
180 185 190 

Leu Glu Gin Asp Glu Ala Phe He Pro Val Gly Glu Ser Leu Lys Asp 
195 200 205 

Leu He Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 

Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Arg Gin Val 
225 230 235 240 

Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg Gly Glu 
245 250 255 

Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe 
260 265 270 

Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu Asn He 
275 280 285 

Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly Ser Trp Thr Gin 
290 295 300 

Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 
305 310 315 320 

Leu Lys Cye Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 
325 330 335 

Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu He Tyr Gly Thr 
340 345 350 

Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Lys Asn He 
355 360 365 

Leu He Lys Lys Asn Gly Ser Cys Cys He Ala Asp Leu Gly Leu Ala 
370 375 380 

Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp He Pro Leu Asn Thr 
385 390 395 400 

Arg Val Gly Thr Arg Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 
405 410 415 

Leu Ser Lys Asn His Phe Gin Pro Tyr He Met Ala Asp He Tyr Ser 
420 425 430 

Phe Gly Leu He He Trp Glu Met Ala Arg Arg Cys He Thr Gly Gly 
435 440 445 

He Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp 
450 455 460 



Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 
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Pro He Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 
485 490 495 

Leu Lye Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 510 

Thr Ala Leu Arg He Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 

Asp Val Lys He 
530 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: CFKl-43a 

( ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 247.. 1752 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCGGCCGCGC CGGCGTGGTG CTCGGAGTGC GGGCGCCGAG GACCCGGGAC CAGGGGCGCG 60 

GCGGCGGGTT GGAGTTCAAG GTACTCGTTA CGTGTGACGA GGAAGTGAAG CCCATTCCAT 120 

GCCTTGCTGA GAAAGGTTCA AACTTCGGCT GAATCACAAC CATTTGGCGC TGAGCTATGA 180 

CAAGAGAGCA AACAAAAAGT TAAAGGAGCA ACTCGGCCAT AAGTGACAGA GAAGTTCGTT 240 

GATAAC ATG CTC TTA CGA AGC TCT GGA AAA TTA AAT GTG GGC ACC AAG 288 
Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys 
15 10 

AAG GAG GAT GGT GAG AGT ACA GCC CCC ACT GCT CGG CCC AAG GTC CTG 336 
Lys Glu Asp Gly Glu Ser Thr Ala Pro Thr Ala Arg Pro Lys Val Leu 
15 20 25 30 

CGT TGT AAA TGC CAC CAC CAC TGT CCT GAA GAC TCA GTC AAC AAT ATC 384 
Arg Cys Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn He 
35 40 45 

TGC AGC ACA GAT GGG TAC TGC TTC ACG ATG ATA GAA GAA GAC GAC TCT 432 
Cys Ser Thr Asp Gly Tyr Cys Phe Thr Met He Glu Glu Asp Asp Ser 
50 55 60 

GGA ACG CCT GTT GTC ACC TCC GGA TGC CTA GGA CTA GAA GGG TCA GAT 480 
Gly Thr Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp 
65 70 75 
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TTT CAA TGT CGC GAC ACG CCC ATC CCT CAT CAG AGA AGG TCA ATT GAA 528 
Phe Gin Cys Arg Asp Thr Pro lie Pro His Gin Arg Arg Ser lie Glu 
80 85 90 

TGC TGC ACA GAA AGG AAC GAA TGT AAT AAA GAT CTC CAC CCC ACG CTG 576 
Cys Cys Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu 
95 100 105 110 

CCT CCC CTG AAG GAC AGA GAT TTT GTT GAT GGA CCC ATA CAC CAC AAA 624 
Pro Pro Leu Lys Asp Arg Asp Phe Val Asp Gly Pro lie His His Lys 
115 120 125 

GCC TTA CTC ATA TCT GTG ACT GTC TGT AGT TTA CTC TTG GTC CTC ATT 672 
Ala Leu Leu lie Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu He 
130 135 140 

ATT TTA TTC TGT TAC TTC AGG TAT AAA AGA CAA GAA GCC AGA CCT CGG 720 
He Leu Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg 
145 150 155 

TAC AGC ATT GGG CTG GAG CAG GAT GAA ACG TAC ATT CCT CCT GGA GAA 768 
Tyr Ser He Gly Leu Glu Gin Asp Glu Thr Tyr He Pro Pro Gly Glu 
160 * 165 170 

TCC CTG AGA GAC TTG ATT GAG .CAA TCG CAG AGC TCG GGA AGT GGC TCA 816 
Ser Leu Arg Asp Leu He Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser 
175 180 185 190 

GGA CTC CCT CTG CTG GTC CAA AGG ACA ATA GCT AAG CAA ATT CAG ATG 864 
Gly Leu Pro Leu Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met 
195 200 205 

GTG AAG CAA ATT GGA AAA GGT CGC TAT GGC GAA GTG TGG ATG GGA AAG 912 
Val Lys Gin He Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys 
210 215 220 

TGG CGT GGA GAA AAG GTA GCT GTG AAA GTG TTC TTC ACC ACG GAG GAA 960 
Trp Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu 
225 230 235 

GCC AGC TGG TTC CGA GAG ACT GAG ATA TAT CAG ACG GTC CTG ATG AGG 1008 
Ala Ser Trp Phe Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg 
240 245 250 

CAC GAG AAC ATT CTG GGG TTC ATT GCA GCA GAT ATC AAA GGG ACT GGG 1056 
His Glu Asn He Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly 
255 260 265 270 

TCT TGG ACT CAG TTA TAC CTC ATC ACA GAC TAT CAT GAA AAC GGG TCT 1104 
Ser Trp Thr Gin Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser 
275 280 285 

CTT TAT GAC TAT CTG AAA TCC ACC ACC TTA GAT GCC AAG TCC ATG CTG 1152 
Leu Tyr Asp Tyr Leu Lye Ser Thr Thr Leu Asp Ala Lys Ser Met Leu 
290 295 300 

AAG CTA GCC TAC TCG TCT GTC AGC GGC CTG TGC CAT CTA CAC ACG GAA 1200 
Lys Leu Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu 
305 310 315 
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ATC TTC AGC ACT CAA GGC AAG CCA GCC ATT GCC CAT CGG GAC TTG AAA 1248 
lie Phe Ser Thr Gin Gly Lye Pro Ala He Ala Hie Arg Asp Leu Lys 
320 325 330 

AGT AAA AAC ATC CTG GTG AAG AAA AAT GGA ACT TGC TGC ATA GCA GAC 1296 
Ser Lys Asn He Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp 
335 340 345 350 

CTG GGC CTG GCT GTC AAG TTC ATT AGT GAC ACA AAT GAG GTT GAC ATT 1344 
Leu Gly Leu Ala Val Lys Phe He Ser Asp Thr Asn Glu Val Asp He 
355 360 365 

CCA CCC AAC ACC CGG GTT GGC ACC AAG CGC TAT ATG CCT CCA GAA GTG 1392 
Pro Pro Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val 
370 375 380 

CTG GAC GAG AGC TTG AAT AGA ACT CAT TTC CAG TCC TAC ATC ATG GCT 1440 
Leu Asp Glu Ser Leu Asn Arg Thr His Phe Gin Ser Tyr He Met Ala 
385 390 395 

GAC ATG TAC AGC TTT GGA CTC ATC CTC TGG GAG ATT GCA AGG AGA TGT 1488 
Asp Met Tyr Ser Phe Gly Leu He Leu Trp Glu He Ala Arg Arg Cys 
400 405 410 

GTT TCT GGA GGT ATA GTG GAA GAA TAC CAG CTT CCA TAT CAC GAC CTG 1536 
Val Ser Gly Gly He Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu 
415 420 425 430 

GTG CCC AGT GAC CCC TCT TAT GAG GAC ATG AGA GAA ATT GTG TGT ATG 1584 
Val Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu He Val Cys Met 
435 440 445 

AAG AAG TTA CGG CCT TCA TTC CCC AAT CGA TGG AGC AGT GAC GAG TGC 1632 
Lys Lys Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys 
450 455 460 

CTC AGG CAA ATG GGG AAG CTT ATG ACA GAG TGC TGG GCG CAT AAT CCT 1680 
Leu Arg Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala His Asn Pro 
465 470 475 

GCC TCC AGG CTG ACG GCC CTG AGA GTT AAG AAA ACA CTT GCC AAA ATG 1728 
Ala Ser Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met 
480 485 490 

TCA GAG TCC CAG GAC ATT AAA CTC TGACGTCAGG TACTTGTGGA CAGAGCAAGG 1782 
Ser Glu Ser Gin Asp He Lys Leu 
495 500 

AATTACACAG AAGCATCCTT AGCCCAAGCC TTGAACGTTG ATCTACTGCC CAGTGAGTTC 1842 

AGACTTTCCT CT AAG AG AGC AAGCTGGACA GACACAGAGG AACCCAGAAA CACGGCTTCA 1902 

CCATGGCTTT CTGAGGAGGG GAAACCATTT GGGTAACTTG TTCAAGATAT GATGCATGTT 1962 

GCTTTCTAAG AAAGCCCTGT ATTTTGGGAT TACCATTTTT TTTAAAGAAG AAAGATACTT 2022 

TAATTTTTAC CAAAATAAAA CAAATATTAT AGAAAAAAAG CGGCCGCAGA ATTC 2076 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys Lys Glu 
15 10 15 

Asp Gly Glu Ser Thr Ala Pro Thr Ala Arg Pro Lys Val Leu Arg Cys 
20 25 30 

Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn lie Cys Ser 
35 40 45 

Thr Asp Gly Tyr Cys Phe Thr Met He Glu Glu Asp Asp Ser Gly Thr 
50 55 60 

Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp Phe Gin 
65 70 75 80 

Cys Arg Asp Thr Pro He Pro His Gin Arg Arg Ser He Glu Cys Cys 
85 90 95 

Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu Pro Pro 
100 105 110 

Leu Lys Asp Arg Asp Phe Val Asp Gly Pro He His His Lys Ala Leu 
115 120 125 

Leu He Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu He He Leu 
130 135 140 

Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg Tyr Ser 
145 150 155 160 

He Gly Leu Glu Gin Asp Glu Thr Tyr He Pro Pro Gly Glu Ser Leu 
165 170 175 

Arg Asp Leu He Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu 
180 185 190 

Pro Leu Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Lys 
195 200 205 

Gin He Gly Lye Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg 
210 215 220 

Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser 
225 230 235 240 

Trp Phe Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu 
245 250 255 

Asn He Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly Ser Trp 
260 265 270 

Thr Gin Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr 
275 280 285 
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Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu Lys Leu 
290 295 300 

Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu lie Phe 
305 310 315 320 

Ser Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys Ser Lys 
325 330 335 

Asn lie Leu Val Lys Lys Asn Gly Thr Cys Cys lie Ala Asp Leu Gly 
340 345 350 

Leu Ala Val Lys Phe lie Ser Asp Thr Asn Glu Val Asp He Pro Pro 
355 360 365 

Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val Leu Asp 
370 375 380 

Glu Ser Leu Asn Arg Thr His Phe Gin Ser Tyr He Met Ala Asp Met 
385 390 395 400 

Tyr Ser Phe Gly Leu He Leu Trp Glu He Ala Arg Arg Cys Val Ser 
405 410 415 

Gly Gly He Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu Val Pro 
420 425 430 

Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu He Val Cys Met Lys Lys 
435 440 445 

Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys Leu Arg 
450 455 460 

Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala His Asn Pro Ala Ser 
465 470 475 480 

Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met Ser Glu 
485 490 495 

Ser Gin Asp He Lys Leu 
500 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: CFKl-lOa 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 474.. 2000 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAATTCTGCG GCCGCGAGGC TGCATTAAGT GGGATATGCC ACCCGTGATT CTGACAGCCG 60 

TGACTGCGTG GAGCCTGCTC CGGAACTCTC CACAGAGGAG CAAAGGAGCT GCCCTCTGTG 120 

TCTCCCCGCC CTTCAGCGAG AGTCTGGAAA GAGAACCGAG GTGCTACTGC AGTGGATGAG 180 

TAGAGAAGAG TCTGCATCCA GTGCTGGTGA GCTTGTCTGG CTATAGGGAG CCTGCTGGGG 240 

GAAACTTACA GCTTCAGAAG ACTCCTGGAG AGCCTCTCCC TCCACACTCT CCCTTTGAGC 300 

AGTCAGTGCC TCTCTGCTGG AGAACCTGTG CTGGGTGTGC CCCAGAGCTG GCTTTGACTG 360 

TAGCCTGTCA GGCTCTCCCT GGACCTCACG GAACAGCATT GCCAGCCACA CGGCTTCCAA 420 

CAAATCACCT CTTTTCATGC TGTTTGGCAC AGATCGAATC TACAGGTTAT ACA ATG 476 

Met 
1 

GTC GAT GGA GCA ATG ATC CTT TCT GTG CTA ATG ATG ATG GCT CTC CCT 524 
Val Asp Gly Ala Met lie Leu Ser Val Leu Met Met Met Ala Leu Pro 
5 10 15 

TCC CCG AGT ATG GAA GAT GAG GAG CCC AAG GTC AAC CCG AAG CTT TAC 572 
Ser Pro Ser Met Glu Asp Glu Glu Pro Lys Val Asn Pro Lys Leu Tyr 
20 25 30 

ATG TGT GTG TGT GAG GGC CTC TCC TGC GGG AAC GAG GAC CAC TGT GAG 620 
Met Cys Val Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His Cys Glu 
35 40 45 

GGC CAG CAG TGT TTT TCC TCC CTG AGC GTC AAT GAT GGC TTC CGC GTC 668 
Gly Gin Gin Cys Phe Ser Ser Leu Ser Val Asn Asp Gly Phe Arg Val 
50 55 60 65 

TAC CAG AAG GGC TGC TTT CAG GTC TAT GAG CAG GGG AAG ATG ACG TGT 716 
Tyr Gin Lys Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr Cys 
70 75 80 

AAG ACC CCG CCG TCG CCT GGC CAG GCT GTG GAG TGC TGC CAA GGG GAC 764 
Lys Thr Pro Pro Ser Pro Gly Gin Ala Val Glu Cys Cys Gin Gly Asp 
85 90 95 

TGG TGC AAC AGG AAC GTC ACG GCC CGG CTG CCC ACT AAA GGG AAA TCC 812 
Trp Cys Asn Arg Asn Val Thr Ala Arg Leu Pro Thr Lys Gly Lys Ser 
100 105 110 

TTC CCT GGA TCG CAG AAC TTC CAC CTG GAA GTT GGC CTT ATC ATC CTC 860 
Phe Pro Gly Ser Gin Asn Phe His Leu Glu Val Gly Leu lie lie Leu 
115 120 125 

TCC GTG GTG TTT GCG GTA TGC CTT TTC GCT TGC ATC CTT GGC GTT GCT 908 
Ser Val Val Phe Ala Val Cys Leu Phe Ala Cys He Leu Gly Val Ala 
130 135 140 145 

CTC AGG AAG TTT AAA AGG CGC AAT CAA GAG CGC CTG AAC CCC AGA GAC 956 
Leu Arg Lys Phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg Asp 
150 155 160 
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GTG GAG TAC GGT ACT ATC GAA GGG CTC ATC ACC ACC AAC GTC GGA GAT 1004 
Val Glu Tyr Gly Thr lie Glu Gly Leu He Thr Thr Asn Val Gly Asp 
165 170 175 

AGC ACT CTA GCG GAA TTA CTA GAT CAC TCA TGT ACA TCA GGA AGT GGC 1052 
Ser Thr Leu Ala Glu Leu Leu Asp His Ser Cys Thr Ser Gly Ser Gly 
180 185 190 

TCC GGT CTT CCT TTT CTG GTA CAG AGA ACT GTG GCT CGA CAG ATA ACC 1100 
Ser Gly Leu Pro Phe Leu Val Gin Arg Thr Val Ala Arg Gin He Thr 
195 200 205 

CTG TTG GAG TGT GTC GGG AAG GGC CGG TAT GGA GAA GTG TGG AGG GGC 1148 
Leu Leu Glu Cys Val Gly Lye Gly Arg Tyr Gly Glu Val Trp Arg Gly 
210 215 220 225 

AGC TGG CAA GGC GAA AAT GTT GCT GTG AAG ATC TTC TCC TCC CGT GAT 1196 
Ser Trp Gin Gly Glu Asn Val Ala Val Lys He Phe Ser Ser Arg Asp 
230 235 240 

GAG AAG TCG TGG TTC AGG GAG ACA GAA TTG TAC AAC ACG GTG ATG CTG 1244 
Glu Lys Ser Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met Leu 
245 250 255 

AGG CAT GAG AAT ATC TTA GGT TTC ATT GCT TCA GAC ATG ACC TCT AGA 1292 
Arg His Glu Asn He Leu Gly Phe He Ala Ser Asp Met Thr Ser Arg 
260 265 270 

CAC TCC AGT ACC CAG CTG TGG CTC ATT ACA CAT TAC CAC GAA ATG GGA 1340 
His Ser Ser Thr Gin Leu Trp Leu He Thr His Tyr His Glu Met Gly 
275 280 285 

TCG TTG TAT GAC TAC CTT CAG CTC ACC ACT CTG GAC ACC GTT AGC TGC 1388 
Ser Leu Tyr Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser Cys 
290 295 300 305 

CTT CGG ATC GTG TTG TCC ATA GCC AGC GGC CTT GCA CAC TTG CAC ATA 1436 
Leu Arg He Val Leu Ser He Ala Ser Gly Leu Ala His Leu His He 
310 315 320 

GAG ATA TTT GGG ACC CAG GGG AAG TCT GCC ATC GCC CAC CGA GAT CTA 1484 
Glu He Phe Gly Thr Gin Gly Lys Ser Ala He Ala His Arg Asp Leu 
325 330 335 

AAG AGC AAA AAC ATC CTC GTG AAG AAG AAC GGA CAG TGC TGC ATA GCA 1532 
Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Gin Cys Cys He Ala 
340 345 350 

GAT TTG GGC CTG GCA GTC ATG CAT TCC CAG AGC ACG AAT CAG CTT GAT 1580 
Asp Leu Gly Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu Asp 
355 360 365 

GTG GGA AAC AAC CCC CGT GTG GGG ACC AAG CGC TAC ATG GCC CCT GAA 1628 
Val Gly Asn Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 
370 375 380 385 

GTG CTT GAT GAA ACC ATC CAA GTG GAT TGC TTT GAT TCT TAT AAG AGG 1676 
Val Leu Asp Glu Thr He Gin Val Asp Cys Phe Asp Ser Tyr Lys Arg 
390 395 400 
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GTC CAT ATT TGG GCC TTT GGC CTC GTT CTG TGG GAA GTG GCC AGG AGG 1724 
Val Asp lie Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg Arg 
405 410 415 

ATG GTG AGC AAT GGT ATA GTG GAA GAT TAC AAG CCA CCA TTC TAT GAT 1772 
Met Val Ser Asn Gly lie Val Glu Asp Tyr Lys Pro Pro Phe Tyr Asp 
420 425 430 

GTT GTT CCC AAT GAC CCA AGT TTT GAA GAT ATG AGG AAA GTT GTC TGT 1820 
Val Val Pro Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val Cys 
435 440 445 

GTG GAT CAA CAG AGG CCA AAC ATA CCT AAC AGA TGG TTC TCA GAC CCG 1868 
Val Asp Gin Gin Arg Pro Asn lie Pro Asn Arg Trp Phe Ser Asp Pro 
450 455 460 465 

ACA TTA ACT TCT CTG GCG AAG CTG ATG AAA GAA TGC TGG TAC CAG AAC 1916 
Thr Leu Thr Ser Leu Ala Lye Leu Met Lys Glu Cys Trp Tyr Gin Asn 
470 475 480 

CCA TCC GCC AGA CTC ACA GCT CTA CGT ATC AAA AAG ACT TTG ACC AAA 1964 
Pro Ser Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Thr Lys 
485 490 * 495 

ATT GAT AAC TCC CTA GAC AAA TTA AAA ACT GAC TGT TGACATTGTC 2010 
lie Asp Asn Ser Leu Asp Lys Leu Lys Thr Asp Cys 
500 505 



ACCGGTGTCA 


AGAAGGAGAG 


TCAATGCTGT 


CATTGTCCAG 


CTGGGACCTA 


ATGCTGGCCT 


2070 


GACTGGTTGT 


CAGAACAGAA 


TCCATCTGTC 


CCCCTCTCCC 


CCCAACTCCC 


G AAG TGG CTG 


2130 


CTTTGACAAA 


AGCAGATGTC 


TCTTCCCAGC 


CATGTTCCGG 


GGGAGACACC 


AAAACCACCC 


2190 


TAACCTCGCT 


CAGAAACTGT 


GACTCGAGCA 


CTTGATGAAC 


TGTTCACACC 


GCAAAGACTA 


2250 


ACGGTGGGCA 


GGTATGTTTG 


CAAGGGGGAG 


GGAAGTGGAG 


GAG CA CAG AG 


AGATCCTGCA 


2310 


GGAGATCTGG 


GCATTAGGAC 


AGTGGCTCTT 


TGCGTATCTT 


CCACGGGTCT 


CCTAGACTCG 


2370 


CCCCACGGGA 


AACTCAAGGA 


GGCGGTGAAT 


TCGTAATCAG 


CAATATTGGC 


TGCGCCTACT 


2430 


CTTCTCTGTT 


GCACTAGGAA 


TTCTCTGCAT 


TCCTTACTTG 


CACTGTCGTC 


CTTAATCTTA 


2490 


AAGACCCGAC 


TTGCCAAAAC 


ATTGGCTGCC 


TACTTCACTG 


GCCTGTCTCT 


GGACAATAGG 


2550 


AATTCAATCT 


GGCGAAACAA 


AAATGTAATG 


TTGGACTTTG 


CTGCATTTTA 


CACACGTGCC 


2610 


GATGTTTACA 


ACGATGCAAA 


CATTAGGAAT 


TGTTTAGACA 


CAACTTTGCA 


AATTATTTAT 


2670 


TACTGGTGCA 


CTTAGCAGTT 


TTTGTTTTTT 


TTTGTTTTTT 


TGTTTTTTTT 


TTGTTTTGTT 


2730 


TTGTTTTTAT 


ATATAAAACT 


GCCTCGTGCG 


TATGTTAAAG 


CTTATTTTTA 


TGTGGTCTTA 


2790 


TGATTTTATT 


ACCGAAATGT 


TTTTAACACC 


CGATTCTGAA 


ATGGATGTTT 


TCTTTTATTA 


2850 


TCAGTTAAAT 


TCACATTTTA 


AATGCTTCAC 


TTTTTTTTTA 


TGTGTGTAGA 


CTGTAACTTT 


2910 


CTTTTCAGTT 


AGTATACAGA 


ACGTATTTAG 


CCATTACCCA 


TGCAACACCA 


CCCAATATAT 


2970 


TACTGATTTA 


GAAGCAAAGA 


TTTCAGTAGA 


ATTTTAGTCC 


CAAACGCTGT 


GGGGGGGAAA 


3030 
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TGCATCTTCT TCGGAACTAT CCATTACATG CATTTAAACT CTGCCAGAAA AAAAAATAAC 3090 

TATTTTGTTT TAATCTACTT TTTCTATTTA GTAGTTATTT GTATAAATTA AATAAACTGT 3150 

TTTCAAGTCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAT 3210 

AAAAAAAAAA AAAGCGGCCG CAGAATTC 3238 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Val Asp Gly Ala Met He Leu Ser Val Leu Met 
15 10 

Pro Ser Pro Ser Met Glu Asp Glu Glu Pro Lys Val 
20 25 

Tyr Met Cys Val Cys Glu Gly Leu Ser Cys Gly Asn 
35 40 



Met Met Ala Leu 
15 

Asn Pro Lys Leu 
30 

Glu Asp His Cys 
45 



Glu Gly Gin Gin Cys Phe Ser Ser Leu Ser Val Asn Asp Gly Phe Arg 
. 50 55 60 



Val Tyr Gin Lys Gly Cys Phe Gin Val Tyr Glu Gin 
65 70 75 

Cys Lys Thr Pro Pro Ser Pro Gly Gin Ala Val Glu 
85 90 

Asp Trp Cys Asn Arg Asn Val Thr Ala Arg Leu Pro 
100 105 

Ser Phe Pro Gly Ser Gin Asn Phe His Leu Glu Val 
115 120 



Gly Lys Met Thr 
80 

Cys Cys Gin Gly 
95 

Thr Lys Gly Lys 
110 

Gly Leu He He 
125 



Leu Ser Val Val Phe Ala Val Cys Leu Phe Ala Cys He Leu Gly Val 
130 135 140 



Ala Leu Arg Lys Phe Lys Arg Arg Asn Gin Glu Arg 
145 150 155 

Asp Val Glu Tyr Gly Thr He Glu Gly Leu He Thr 
165 170 

Asp Ser Thr Leu Ala Glu Leu Leu Asp His Ser Cys 
180 185 

Gly Ser Gly Leu Pro Phe Leu Val Gin Arg Thr Val 
195 200 



Leu Asn Pro Arg 
160 

Thr Asn Val Gly 
175 

Thr Ser Gly Ser 
190 

Ala Arg Gin He 
205 



Thr Leu Leu Glu Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg 
210 215 220 
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Gly Ser Trp Gin Gly Glu Asn Val Ala Val Lye lie Phe Ser Ser Arg 
225 230 235 240 

Asp Glu Lye Ser Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met 
245 250 255 

Leu Arg His Glu Asn He Leu Gly Phe He Ala Ser Asp Met Thr Ser 
260 265 270 

Arg His Ser Ser Thr Gin Leu Trp Leu He Thr His Tyr His Glu Met 
275 280 285 

Gly Ser Leu Tyr Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser 
290 295 300 

Cys Leu Arg He Val Leu Ser He Ala Ser Gly Leu Ala His Leu His 
305 310 315 320 

He Glu He Phe Gly Thr Gin Gly Lys Ser Ala He Ala His Arg Asp 
325 330 335 

Leu Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Gin Cys Cys He 
340 345 350 

Ala Asp Leu Gly Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu 
355 360 365 

Asp Val Gly Asn Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro 
370 375 380 



Glu Val Leu Asp Glu Thr He Gin Val Asp Cys Phe Asp Ser Tyr Lys 
385 390 395 400 

Arg Val Asp lie Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg 
405 410 415 

Arg Met Val Ser Asn Gly He Val Glu Asp Tyr Lys Pro Pro Phe Tyr 
420 425 430 

Asp Val Val Pro Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val 
435 440 445 

Cys Val Asp Gin Gin Arg Pro Asn He Pro Asn Arg Trp Phe Ser Asp 
450 455 460 

Pro Thr Leu Thr Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gin 
465 470 475 480 

Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Thr 
485 490 495 

Lys He Asp Asn Ser Leu Asp Lys Leu Lys Thr Asp Cys 
500 505 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: W-101 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 80.. 1594 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAATCTGCGG CCGCGAG6GA GAGAGGCGCC GGGGGCGCGC GCGCGCGCTG GGCGCTGCTG 

GGCTGCGGCG GCGGTTACT ATG GCG GAG TCG GCC GGA GCC TCC TCC TTC TTC 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe 
1 5 10 

CCC CTT GTT GTC CTC CTG CTC GCC GGC AGC GGC GGG TCC GGG CCC CGG 
Pro Leu Val Val Leu Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg 
15 20 25 

GGG ATC CAG GCT CTG CTG TGT GCG TGC ACC AGC TGC CTA CAG ACC AAC 
Gly lie Gin Ala Leu Leu Cye Ala Cys Thr Ser Cys Leu Gin Thr Asn 
30 35 40 

TAC ACC TGT GAG ACA GAT GGG GCT TGC ATG GTC TCC ATC TTT AAC CTG 
Tyr Thr Cye Glu Thr Asp Gly Ala Cys Met Val Ser lie Phe Asn Leu 
45 50 55 

GAT GGC GTG GAG CAC CAT GTA CGT ACC TGC ATC CCC AAG GTG GAG CTG 
Asp Gly Val Glu His Hie Val Arg Thr Cys He Pro Lys Val Glu Leu 
60 65 70 75 

GTT CCT GCT GGA AAG CCC TTC TAC TGC CTG AGT TCA GAG GAT CTG CGC 
Val Pro Ala Gly Lys Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg 
80 85 90 

AAC ACA CAC TGC TGC TAT ATT GAC TTC TGC AAC AAG ATT GAC CTC AGG 
Asn Thr His Cys Cys Tyr He Asp Phe Cys Asn Lys He Asp Leu Arg 
95 100 105 

GTC CCC AGC GGA CAC CTC AAG GAG CCT GCG CAC CCC TCC ATG TGG GGC 
Val Pro Ser Gly His Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly 
110 115 120 

CCT GTG GAG CTG GTC GGC ATC ATC GCC GGC CCC GTC TTC CTC CTC TTC 
Pro Val Glu Leu Val Gly He He Ala Gly Pro Val Phe Leu Leu Phe 
125 130 135 

CTT ATC ATT ATC ATC GTC TTC CTG GTC ATC AAC TAT CAC CAG CGT GTC 
Leu He He He He Val Phe Leu Val He Asn Tyr His Gin Arg Val 
140 145 150 155 

TAC CAT AAC CGC CAG AGG TTG GAC ATG GAG GAC CCC TCT TGC GAG ATG 
Tyr His Asn Arg Gin Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met 
160 " 165 170 
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TGT CTC TCC AAA GAC AAG ACG CTC CAG GAT CTC GTC TAC GAC CTC TCC 640 
Cys Leu Ser Lye Asp Lye Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser 
175 180 185 

ACG TCA GGG TCT GGC TCA GGG TTA CCC CTT TTT GTC CAG CGC ACA GTG 688 
Thr Ser Gly Ser Gly Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val 
190 195 200 

GCC CGA ACC ATT GTT TTA CAA GAG ATT ATC GGC AAG GGC CGG TTC GGG 736 
Ala Arg Thr He Val Leu Gin Glu He He Gly Lye Gly Arg Phe Gly 
205 210 215 

GAA GTA TGG CGT GGT CGC TGG AGG GGT GGT GAC GTG GCT GTG AAA ATC 784 
Glu Val Trp Arg Gly Arg Trp Arg Gly Gly Asp Val Ala Val Lys He 
220 ~ 225 230 235 

TTC TCT TCT CGT GAA GAA CGG TCT TGG TTC CGT GAA GCA GAG ATC TAC 832 
Phe Ser Ser Arg Glu Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr 
240 245 250 

CAG ACC GTC ATG CTG CGC CAT GAA AAC ATC CTT GGC TTT ATT GCT GCT 880 
Gin Thr Val Met Leu Arg Hie Glu Asn He Leu Gly Phe He Ala Ala 
255 260 265 

GAC AAT AAA GAT AAT GGC ACC TGG ACC CAG CTG TGG CTT GTC TCT GAC 928 
Asp Asn Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp 
270 275 280 

TAT CAC GAG CAT GGC TCA CTG TTT GAT TAT CTG AAC CGC TAC ACC GTG 976 
Tyr His Glu His Gly Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val 
285 290 295 

ACC ATT GAG GGC ATG ATT AAG CTA GCC TTG TCT GCA GCC AGT GGT TTG 1024 
Thr He Glu Gly Met He Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu 
300 305 310 315 

GCA CAC CTG CAT ATG GAG ATT GTG GGC ACT CAA GGG AAG CCG GGA ATT 1072 
Ala His Leu His Met Glu He Val Gly Thr Gin Gly Lys Pro Gly He 
320 325 330 

GCT CAT CGA GAC TTG AAG TCA AAG AAC ATC CTG GTG AAA AAA AAT GGC 1120 
Ala His Arg Asp Leu Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly 
335 340 345 

ATG TGT GCC ATT GCA GAC CTG GGC CTG GCT GTC CGT CAT GAT GCG GTC 1168 
Met Cys Ala He Ala Asp Leu Gly Leu Ala Val Arg His Asp Ala Val 
350 355 360 

ACT GAC ACC ATA GAC ATT GCT CCA AAT CAG AGG GTG GGG ACC AAA CGA 1216 
Thr Asp Thr He Asp He Ala Pro Asn Gin Arg Val Gly Thr Lys Arg 
365 370 375 

TAC ATG GCT CCT GAA GTC CTT GAC GAG ACA ATC AAC ATG AAG CAC TTT 1264 
Tyr Met Ala Pro Glu Val Leu Asp Glu Thr lie Asn Met Lys His Phe 
380 385 390 395 

GAC TCC TTC AAA TGT GCC GAC ATC TAT GCC CTC GGG CTT GTC TAC TGG 1312 
Asp Ser Phe Lys Cys Ala Asp He Tyr Ala Leu Gly Leu Val Tyr Trp 
400 405 410 
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GAG ATT GCA CGA AGA TGC AAT TCT GGA GGA GTC CAT GAA GAC TAT CAA 
Glu lie Ala Arg Arg Cys Asn Ser Gly Gly Val Hie Glu Asp Tyr Gin 
415 " 420 425 



1360 



CTG CCG TAT TAC GAC TTA GTG CCC TCC GAC CCT TCC ATT GAG GAG ATG 
Leu Pro Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met 
430 435 440 



1408 



CGA AAG GTT GTA TGT GAC CAG AAG CTA CGG CCC AAT GTC CCC AAC TGG 
Arg Lye Val Val Cys Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp 
445 450 455 



1456 



TGG CAG AGT TAT GAG GCC TTG CGA GTG ATG GGA AAG ATG ATG CGG GAG 
Trp Gin Ser Tyr Glu Ala Leu Arg Val Met Gly Lys Met Met Arg Glu 
460 465 470 475 



1504 



TGC TGG TAC GCC AAT GGT GCT GCC CGT CTG ACA GCT CTG CGC ATC AAG 
Cys Trp Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys 
480 485 490 



1552 



AAG ACT CTG TCC CAG CTA AGC GTG CAG GAA GAT GTG AAG ATT 
Lys Thr Leu Ser Gin Leu Ser Val Gin Glu Asp Val Lys lie 
495 500 505 



1594 



TAAGCTGTTA AGATGCCTAC ACAAAGAACC TGGGCAGTGA GGATGACTGC AGG 



1647 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 505 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro 
15 10 

Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly 
20 25 

Leu Cys Ala Cys Thr Ser Cys Leu Gin Thr Asn Tyr 
35 40 



Leu Val Val Leu 
15 

lie Gin Ala Leu 
30 

Thr Cys Glu Thr 
45 



Asp Gly Ala Cys Met Val Ser He Phe Asn Leu Asp Gly Val Glu His 
50 55 60 



His Val Arg Thr Cys He Pro Lys Val Glu Leu Val 
65 70 " 75 

Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn 
85 90 

Tyr He Asp Phe Cys Asn Lys He Asp Leu Arg Val 
100 105 

Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro 
115 120 



Pro Ala Gly Lys 
80 

Thr His Cys Cys 
95 

Pro Ser Gly His 
110 

Val Glu Leu Val 
125 
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Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie 
130 135 140 

Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr Hie 
145 150 155 

Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu 
165 170 

Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser 
180 185 

Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg 
195 200 205 

Leu Gin Glu He He Gly Lys Gly Arg Phe Gly Glu Val 
210 215 220 

Arg Trp Arg Gly Gly Asp Val Ala Val Lys He Phe Ser 
225 230 235 

Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr 
245 250 

Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn 
260 265 

Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His 
275 280 285 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr He 
290 * 295 300 

He Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His 
305 310 315 

Glu He Val Gly Thr Gin Gly Lys Pro Gly He Ala His 
325 330 

Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Met Cys 
340 345 

Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp 
355 360 365 

He Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met 
370 375 380 

Val Leu Asp Glu Thr He Asn Met Lys His Phe Asp Ser 
385 390 395 

Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu He 
405 410 

Cys Asn Ser Gly Gly Val His Glu Asp Tyr Gin Leu Pro 
420 425 

Leu Val Pro Ser Asp Pro Ser He Glu Glu Met Arg Lys 
435 440 445 

Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin 
450 455 460 



He He He 

Asn Arg Gin 
160 

Ser Lys Asp 
175 

Gly Ser Gly 
190 

Thr He Val 

Trp Arg Gly 

Ser Arg Glu 
240 

Val Met Leu 
255 

Lys Asp Asn 
270 

Glu His Gly 

Glu Gly Met 

Leu His Met 
320 

Arg Asp Leu 

335 

Ala He Ala 
350 

Thr He Asp 

Ala Pro Glu 

Phe Lys Cys 
400 

Ala Arg Arg 
415 

Tyr Tyr Asp 
430 

Val Val Cys 
Ser Tyr Glu 
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Ala Leu Arg Val Met Gly Lye Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Ser Gin 
485 490 495 

Leu Ser Val Gin Glu Asp Val Lys He 
500 505 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: W-120 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 83. • 1591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAATTCGCGG CCGCGGGCGA GGCTTCCTGA GGAGAAGCTG CGGCCGGGGC CGGGCCGGGC 60 

CACAAACAGT GGCGGCGGGA CC ATG GAG GCG GCG GCC GCT GCT CCA CGT CGT 112 

Met Glu Ala Ala Ala Ala Ala Pro Arg Arg 
15 10 

CCG CAG CTC CTC ATC GTG TTG GTG GCG GCG GCG ACG CTG CTC CCG GGG 160 
Pro Gin Leu Leu He Val Leu Val Ala Ala Ala Thr Leu Leu Pro Gly 
15 20 25 

GCG AAG GCA TTA CAG TGT TTC TGC CAC CTC TGT ACA AAG GAT AAT TTT 208 
Ala Lys Ala Leu Gin Cys Phe Cys His Leu Cys Thr Lys Asp Asn Phe 
30 35 40 

ACC TGT GAG ACA GAT GGT CTT TGC TTT GTC TCA GTC ACT GAG ACC ACA 256 
Thr Cys Glu Thr Asp Gly Leu Cys Phe Val Ser Val Thr Glu Thr Thr 
45 * 50 55 

GAC AAA GTT ATA CAC AAT AGT ATG TGT ATA GCT GAA ATT GAC CTA ATT 304 
Asp Lys Val He His Asn Ser Met Cys He Ala Glu He Asp Leu He 
60 65 70 

CCT CGA GAC AGG CCA TTT GTA TGT GCA CCA TCT TCA AAA ACA GGG GCA 352 
Pro Arg Asp Arg Pro Phe Val Cys Ala Pro Ser Ser Lys Thr Gly Ala 
75 80 85 90 

GTT ACT ACA ACA TAT TGC TGC AAT CAG GAC CAC TGC AAT AAA ATA GAA 400 
Val Thr Thr Thr Tyr Cys Cys Asn Gin Asp His Cys Asn Lys He Glu 
95 " 100 105 

CTC CCA ACT ACA GGA CCT TTT TCA GAA AAG CAG TCA GCT GGC CTT GGT 448 
Leu Pro Thr Thr Gly Pro Phe Ser Glu Lys Gin Ser Ala Gly Leu Gly 
110 " 115 120 
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CCT CTG GAG CTG GCA GCT GTC ATT GCT GGT CCA GTC TGC TTC GTC TGC 496 
Pro Val Glu Leu Ala Ala Val He Ala Gly Pro Val Cys Phe Val Cys 
125 130 135 

ATT GCA CTT ATG CTG ATG GTC TAT ATC TGC CAT AAC CGC ACT GTC ATT 544 
He Ala Leu Met Leu Met Val Tyr He Cys His Asn Arg Thr Val He 
140 145 150 

CAC CAC CGT GTG CCA AAT GAA GAG GAT CCA TCA CTA GAT CGC CCT TTC 592 
His His Arg Val Pro Asn Glu Glu Asp Pro Ser Leu Asp Arg Pro Phe 
155 160 165 170 

ATT TCA GAG GGC ACC ACC TTA AAA GAT TTA ATT TAT GAT ATG ACA ACA 640 
He Ser Glu Gly Thr Thr Leu Lys Asp Leu He Tyr Asp Met Thr Thr 
175 180 185 

TCA GGG TCT GGA TCA GGT TTA CCA CTG CTT GTT CAA AGA ACA ATT GCC 688 
Ser Gly Ser Gly Ser Gly Leu Pro Leu Leu Val Gin Arg Thr He Ala 
190 195 200 

AGG ACC ATT GTG TTA CAA GAA AGC ATT GGC AAA GGT CGG TTT GGA GAA 736 
Arg Thr He Val Leu Gin Glu Ser He Gly Lys Gly Arg Phe Gly Glu 
205 210 215 

GTT TGG CGA GGC AAA TGG CGG GGA GAA GAA GTT GCT GTG AAG ATA TTC 784 
Val Trp Arg Gly Lys Trp Arg Gly Glu Glu Val Ala Val Lys He Phe 
220 225 230 

TCT TCT AGA GAA GAG CGT TCA TGG TTC CGA GAG GCA GAG ATT TAT CAG 832 
Ser Ser Arg Glu Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin 
235 240 245 250 

ACT GTA ATG TTA CGC CAT GAA AAT ATC CTG GGA TTT ATA GCA GCA GAC 880 
Thr Val Met Leu Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp 
255 260 265 

AAC AAA GAC AAT GGG ACA TGG ACG CAG CTG TGG TTG GTG TCA GAT TAT 928 
Asn Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr 
270 275 280 

CAT GAG CAT GGA TCC CTT TTC GAT TAC TTG AAT AGA TAC ACT GTT ACT 976 
His Glu His Gly Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr 
285 "* 290 295 

GTG GAA GGA ATG ATC AAG CTT GCT CTG TCC ACA GCA AGT GGT CTT GCC 1024 
Val Glu Gly Met He Lye Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala 
300 " 305 310 

CAT CTT CAC ATG GAG ATT GTT GGT ACC CAA GGA AAA CCA GCT ATT GCC 1072 
His Leu His Met Glu He Val Gly Thr Gin Gly Lys Pro Ala He Ala 
315 320 325 330 

CAT AGA GAT TTG AAA TCA AAG AAT ATC TTG GTG AAG AAA AAT GGA ACC 1120 
His Arg Asp Leu Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Thr 
335 * 340 345 

TGT TGT ATT GCA GAC TTG GGA CTT GCT GTG AGA CAT GAT TCT GCC ACA 1168 
Cys Cys He Ala Asp Leu Gly Leu Ala Val Arg His Asp Ser Ala Thr 
350 355 360 
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GAT ACA ATT GAT ATT GCT CCA AAC CAC AGA GTA GGC ACT AAA AGG TAC 1216 

Asp Thr lie Asp lie Ala Pro Asn His Arg Val Gly Thr Lys Arg Tyr 
365 370 375 

ATG GCC CCT GAA GTT CTA GAT GAT TCC ATA AAT ATG AAA CAT TTT GAA 1264 

Met Ala Pro Glu Val Leu Asp Asp Ser lie Asn Met Lys His Phe Glu 
380 385 390 

TCC TTC AAA CGC GCT GAC ATC TAT GCA ATG GGC TTA GTG TTC TGG GAA 1312 

Ser Phe Lys Arg Ala Asp lie Tyr Ala Met Gly Leu Val Phe Trp Glu 

395 400 405 410 

ATT GCT CGA CGC TGT TCT ATT GGT GGA ATC CAT GAA GAC TAT CAG TTG 1360 

lie Ala Arg Arg Cys Ser lie Gly Gly lie His Glu Asp Tyr Gin Leu 
415 420 425 



CCT TAT TAT GAT CTT GTA CCT TCT GAT CCA TCG GTT GAA GAA ATG AGA 1408 
Pro Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser Val Glu Glu Met Arg 
430 435 440 

AAA GTA GTT TGC GAA CAG AAG TTA AGG CCA AAT ATT CCA AAC AGA TGG 1456 
Lys Val Val Cys Glu Gin Lys Leu Arg Pro Asn lie Pro Asn Arg Trp 
445 450 455 

CAG AGC TGT GAG GCC TTG AGA GTG ATG GCT AAA ATT ATG AGA GAA TGC 1504 
Gin Ser Cys Glu Ala Leu Arg Val Met Ala Lys He Met Arg Glu Cys 
460 465 470 

TGG TAT GCC AAT GGA GCA GCA AGG CTG ACA GCT TTG CGA ATT AAA AAA 1552 
Trp Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys 
475 480 485 490 

ACA TTG TCA CAA CTC AGC CAA CAG GAA GGC ATC AAA ATG TAACTGAAAC 1601 
Thr Leu Ser Gin Leu Ser Gin Gin Glu Gly He Lys Met 
495 500 

ACCGTGGGAA CTCTGCTCTC TTCATATCTG CTCCTGGGTG TTT AGG AGG C TGGTTGTTCT 1661 

ACCTCACTGA GAGAACAGAG GGCTCTGCTT CCTCTTGCAG CAGTGGAATA TGGTCAACTG 1721 

AAAGCTTCCC AGGGTTTCTC TGGGCCCAGA GGCAGCCGTG GGGTCCTTCT GTGCACTATG 1781 

GATAACTTCT TCC 1794 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Glu Ala Ala Ala Ala Ala Pro Arg Arg Pro Gin Leu Leu He Val 
15 10 15 

Leu Val Ala Ala Ala Thr Leu Leu Pro Gly Ala Lys Ala Leu Gin Cys 
20 25 30 
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Phe Cyfl HiB Leu Cye Thr Lys Asp Asn Phe Thr Cys Glu Thr Asp Gly 
35 40 45 

Leu Cys Phe Val Ser Val Thr Glu Thr Thr Asp Lys Val He His Asn 
50 55 60 

Ser Met Cys He Ala Glu He Asp Leu He Pro Arg Asp Arg Pro Phe 
65 70 75 80 

Val Cye Ala Pro Ser Ser Lys Thr Gly Ala Val Thr Thr Thr Tyr Cys 
85 90 95 

Cys Asn Gin Asp His Cys Asn Lys He Glu Leu Pro Thr Thr Gly Pro 
100 105 110 

Phe Ser Glu Lys Gin Ser Ala Gly Leu Gly Pro Val Glu Leu Ala Ala 
115 120 125 

Val He Ala Gly Pro Val Cys Phe Val Cys He Ala Leu Met Leu Met 
130 135 140 

Val Tyr He Cys His Asn Arg Thr Val He His His Arg Val Pro Asn 
145 150 155 160 

Glu Glu ABp Pro Ser Leu Asp Arg Pro Phe He Ser Glu Gly Thr Thr 
165 170 175 

Leu Lys Asp Leu He Tyr Asp Met Thr Thr Ser Gly Ser Gly Ser Gly 
180 185 190 

Leu Pro Leu Leu Val Gin Arg Thr He Ala Arg Thr He Val Leu Gin 
195 200 205 

Glu Ser He Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly Lys Trp 
210 215 220 

Arg Gly Glu Glu Val Ala Val Lys He Phe Ser Ser Arg Glu Glu Arg 
225 230 235 240 

Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr Val Met Leu Arg His 
245 250 255 

Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Asp Asn Gly Thr 
260 265 270 



Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly Ser Leu 

275 280 285 

Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val Glu Gly Met He Lys 

290 295 300 

Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala His Leu His Met Glu He 
305 310 315 320 

Val Gly Thr Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser 
325 330 335 

Lys Asn He Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp Leu 

340 * " 345 350 

Gly Leu Ala Val Arg His Asp Ser Ala Thr Asp Thr He Asp He Ala 

355 360 365 



69 



Pro Asn Hie Arg Val Gly Thr Lys 
370 375 

% Asp Aep Ser lie Asn Met Lys His 
I 385 390 

*' lie Tyr Ala Met Gly Leu Val Phe 

405 

lie Gly Gly lie His Glu Asp Tyr 
420 

Pro Ser Asp Pro Ser Val Glu Glu 
435 440 

Lys Leu Arg Pro Asn lie Pro Asn 
450 455 

Arg Val Met Ala Lys lie Met Arg 
465 470 

Ala Arg Leu Thr Ala Leu Arg lie 
485 

Gin Gin Glu Gly lie Lys Met 
500 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: KDA-B5 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 25.. 318 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGATCCGAAT ACGTGGCGGT TAAA ATA TTC TCC TCC AGG GAT GAG AGA TCT 51 

lie Phe Ser Ser Arg Asp Glu Arg Ser 
1 5 

TGG TTC CGT GAG GCG GAA ATT TAT CAG ACG GTG ATG CTG AGA CAC GAG 99 
Trp Phe Arg Glu Ala Glu He Tyr Gin Thr Val Met Leu Arg His Glu 
10 15 20 25 

AAC ATC CTC GGT TTC ATC GCA GCT GAC AAC AAA GAT AAT GGA ACT TGG 147 
Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Asp Asn Gly Thr Trp 
30 35 40 

ACA CAA CTC TGG CTT GTG TCA GAG TAT CAC GAG CAG GGC TCC TTG TAT 195 
Thr Gin Leu Trp Leu Val Ser Glu Tyr His Glu Gin Gly Ser Leu Tyr 
45 50 55 



Arg Tyr 

Phe Glu 

Trp Glu 
410 

Gin Leu 
425 

Met Arg 
Arg Trp 
Glu Cys 



Lys Lys 
490 



Met Ala Pro 
380 

Ser Phe Lys 
395 

He Ala Arg 
Pro Tyr Tyr 



Lys Val Val 
445 

Gin Ser Cys 
460 

Trp Tyr Ala 
475 

Thr Leu Ser 



Glu Val Leu 

Arg Ala Asp 
400 

Arg Cys Ser 
415 

Asp Leu Val 
430 

Cys Glu Gin 
Glu Ala Leu 



Asn Gly Ala 
480 

Gin Leu Ser 
495 
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GAC TAT TTG AAC AGA AAC ATA GTG ACT GTG GCT GGA ATG GTC AAG CTG 243 
Asp Tyr Leu Asn Arg Asn lie Val Thr Val Ala Gly Met Val Lys Leu 
60 65 70 

GCG CTT TCC ATA GCG AGT GGT CTG GCT CAC CTG CAC ATG GAG ATC GTG 291 
Ala Leu Ser He Ala Ser Gly Leu Ala Hie Leu His Met Glu He Val 
75 80 85 

GGT ACT CAA GGT AAG CTT GCT ATT GCT CACGGTGATA TCAAAAGTCT 338 
Gly Thr Gin Gly Lye Leu Ala He Ala 
90 95 

AGA 341 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

He Phe Ser Ser Arg Asp Glu Arg Ser Trp Phe Arg Glu Ala Glu He 
15 10 15 

Tyr Gin Thr Val Met Leu Arg His Glu Asn He Leu Gly Phe He Ala 
20 25 30 

Ala Asp Asn Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser 
35 40 45 

Glu Tyr His Glu Gin Gly Ser Leu Tyr Asp Tyr Leu Asn Arg Asn He 
50 55 60 

Val Thr Val Ala Gly Met Val Lys Leu Ala Leu Ser He Ala Ser Gly 
65 70 75 80 

Leu Ala His Leu His Met Glu He Val Gly Thr Gin Gly Lys Leu Ala 
85 90 95 

He Ala 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: PRIMER A 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGGATCCGA RTAYGTNGCN GTNAAR 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PRIMER B 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GACTGTAGAR CTYTTDATRT CYCTRTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: PRIMER C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GACTCTAGAR CTYTTDATRT CNCGRTG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: PRIMER D 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GACTCTAGNG AYTTDATRTC YCTRTG 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: PRIMER E 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GACTCTAGAN GAYTTDATRT CNCGRTG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: PEPTIDE SEQUENCE OF KDA-B5 USED TO DESIGN 
PRIMER A 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Asn Glu Tyr Val Ala Val Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: PEPTIDE SEQUENCE OF KDA-B5 USED TO DESIGN 
PRIMERS B THRU E 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

His Arg Asp lie Lys Ser 
1 5 
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